MRI and EEG Data

Data Overview	Data Download	Data Quality	Data License	Notes

☰

Data Overview

MRI Protocol
MRI Overview

EEG Protocol

Resting State
Sequence Learning Paradigm
WISC Symbol Search (Processing Speed)
Inhibition/Excitation Paradigm (Surround Suppression) 1
Visual Perception/Decision-Making Paradigm (Contrast Change) 1
Video 1
Visual Perception/Decision-Making Paradigm (Contrast Change) 2
Video 2
Visual Perception/Decision-Making Paradigm (Contrast Change) 3
Video 3
Inhibition/Excitation Paradigm (Surround Suppression) 2
Video 4 (The Present)

For EEG data interpretation and organization, click below to download organizational ReadMe files that describe event triggers, behavioral data labels, and EEG channel location.

EEG README - .zip file containing a general ReadMe file, a ReadMe file for each EEG paradigm, and a WISC Symbol Search key in both excel and .mat format.
EEG Channel Location - .sfp file containing the channel location of the EEG montage. More information about the EGI to 10-10 Layout Comparison can be found here and here.

Data Download

Download the Metadata File, which indicates whether participants completed phenotypic, MRI, and EEG sessions (0 = No, 1 = Yes), includes information on the number of runs or tasks completed, and provides additional metadata for each participant.

Click on the EEG and MRI download links in the Dowload Table below.

non-BIDS data

Log in or register with the 1000 Functional Connectomes Project website on NITRC to gain access to the HBN neuroimaging datasets (if you are not already logged in or registered).
Use the checkboxes to select which subjects you would like to download or access the data through Amazon Web Services (AWS) and Cyberduck.

If you select BIDS data, you will be redirected to download the data in Brain Imaging Data Structure (BIDS) format. For more information about the HBN data in BIDS format, see the MRI and the EEG dedicated webpages.

Note that imaging data are released regardless of whether or not participants completed the remainder of the study. Therefore, some participants with imaging data may not have full phenotypic data available until a future release.

Note that data in AWS is not organized by release. For both EEG and MRI, BIDS and non-BIDS data follow slightly different release organizational structures. The release organization level is maintained for internal organizational purposes and for researchers who have downloaded previous data releases and would like to download only new data releases.

Note that differences in scanners and sequences across sites introduces significant technical variance that requires harmonization. Therefore, users may want to statistically harmonize neuroimaging data across data acquisition sites (see, for instance, Chen et al., 2022).

For information about preprocessed MRI data, please visit our Open Science Initiative page; please note that the preprocessing pipeline for the available preprocessed EEG data uses project-specific parameters. Therefore, it is strongly recommended to preprocess the raw EEG data independently.

Please note that the same participant can have EEG and MRI data released in different releases, and thus is included in the Basic_Info File of each release. Additionally, some participants have only EEG data available, while others have only MRI data available.

**Download Table**

Release Number (Release Date)	EEG non-BIDS	MRI non-BIDS	EEG BIDS	MRI BIDS
Release 1.1 (1/31/2018)	EEG (603 subjects)	MRI (616 subjects)	EEG (136 subjects)	MRI (2611 subjects)
Release 2.1 (1/31/2018)	EEG (156 subjects)	MRI (159 subjects)	EEG (154 subjects)
Release 3 (1/31/2018)	EEG (188 subjects)	MRI (202 subjects)	EEG (185 subjects)
Release 4 (7/31/2018)	EEG (359 subjects)	MRI (402 subjects)	EEG (324 subjects)
Release 5 (11/27/2018)	EEG (272 subjects)	MRI (205 subjects)	EEG (330 subjects)
Release 6 (02/15/2019)	EEG (213 subjects)	MRI (174 subjects)	EEG (135 subjects)
Release 7 (09/17/2019)	EEG (476 subjects)	MRI (438 subjects)	EEG (381 subjects)
Release 8 (01/03/2020)	EEG (361 subjects)	MRI (309 subjects)	EEG (257 subjects)
Release 9 (12/07/2020)	EEG (364 subjects)	MRI (243 subjects)	EEG (295 subjects)
Release 10 (04/13/2022)	EEG (576 subjects)	MRI (700 subjects)	EEG (533 subjects)	coming soon
Release 11 (11/23/2022)	EEG (572 subjects)	MRI (459 subjects)	EEG (430 subjects)	coming soon

Data Quality

Consistent with policies established through our prior data generation and sharing initiatives (i.e., FCP/INDI (Mennes et al. 2013); NKI-Rockland Sample (Nooner et al. 2012)), all imaging datasets collected through the HBN are being made available to users regardless of data quality. This decision is justified by a lack of consensus in the imaging community on what constitutes “good” or “poor” quality data. Also, “lower quality” datasets can facilitate the development of artifact correction techniques and of evaluating the impact of such real-world confounds on reliability and reproducibility. Given the range of clinical presentations in the HBN, the inclusion of datasets of varying qualities creates a unique opportunity to test for associations with participant-related variables of interest beyond age and hyperactivity (e.g., anxiety, autistic traits).
We used MRIQC to assess the quality of MRI scans by extracting image quality metrics (IQMs) from structural (T1w, T2w) and functional (BOLD) data. See graphs below to view anatomical T1/T2 and functional graphs of these measures for each site. MRI MRIQC Data can be downloaded below.

View Graphs

Please click here to view anatomical T1 graphs for each site

Please click here to view anatomical T2 graphs for each site

Please click here to view functional graphs for each site

Download MRI MRIQC Data

Please click here for anatomical data

Please click here for functional data

Data License

HBN datasets are being distributed using Creative Commons BY 4.0 License, which allows for commercial use of datasets. However, a subset of phenotypic, MRI, EEG, and eyetracking datasets of participants are distributed under the Creative Commons, Attribution Non-Commercial Share Alike License, which does not allow for commercial use of datasets. To identify participants whose data cannot be used for commercial purposes, review the column "Commercial_Use" in the Metadata File above, where "No" indicates "no commerical use allowed".

Notes

Data losses

Some participants may not be able to successfully complete all components of the HBN protocol due to a variety of factors (e.g., participants experiencing claustrophobia may not be able to stay in the scanner for the full session, a participant with sensory issues may have a more limited ability to participate in the EEG protocol). To prevent data loss when possible, we include exposure procedures such as a mock MRI scanner experience during session 1, and repeat exposures to an EEG cap prior to session 4. Overall, we attempt to collect as much of the data as possible within the allotted data collection intervals and code data losses when they occur.

Handling Head Motion in MRI Data

Head motion presents an unavoidable challenge for developmental and clinical imaging, regardless of MRI modality (fMRI, dMRI, sMRI). Arguably, the most basic strategy for handling motion, short of applying an uncomfortable motion-restricting apparatus, is limiting analyses to high-quality datasets. The Brain Genomic Superstruct data release is an excellent example of the utility of large-scale datasets in supporting such a strategy, as 1570 datasets were selected for analyses from a pool of 3000 individuals following rigorous quality control (Holmes et al. 2015). A limitation of this strategy for psychiatric data is that many phenotypes of interest are inherently more prone to head motion (e.g., children under 9, those with Attention-Deficit/Hyperactivity Disorder), especially those with higher levels of symptomatology. Compounding the downsides of discarding data are the increased costs associated with the recruitment and phenotyping of clinical populations.

For functional MRI, an alternative strategy is to statistically correct the data for movement-induced intensity fluctuations, or remove offending time frames altogether (Power et al. 2015). This can be accomplished by a number of means, ranging from regressing a model of movement from the data (e.g., spike regression (Satterthwaite et al. 2013)), removing the contributions of motion-related spatial patterns from the data (AROMA (Pruim et al. 2015)), attenuating motion spikes using a squashing function, removing offending frames, zeroing out offending frames, or deleting offending frames followed by interpolation. More generalized correction approaches, such as global signal regression and forms of white matter and cerebrospinal fluid regression (e.g., tCompCor, aCompCor (Behzadi et al. 2007; Chai et al. 2012)) can also help to account for motion artifacts. While there is no consensus approach to date, there is a growing literature focused on providing benchmark evaluations of these approaches, as well as their relative merits and weaknesses (e.g., see Ciric et al. 2017; Yan et al. 2013), that can be used to help select among these corrections.

More broadly, group-level statistical corrections can be used to account for the contributions of motion-related artifacts to associations revealed through data analysis (Satterthwaite et al. 2013). In the case of functional MRI, this can be accomplished by including motion parameters as a statistical covariate at the group level. Given the trait nature of head motion (Zuo et al. 2014), some have advocated for using fMRI-derived motion parameters in structural analysis as well. Alternatively, accounting for full-brain differences in measures of interest at the group-level has been shown to be a potentially valuable approach to minimizing the deleterious effects of motion, particularly for fMRI (Yan et al. 2013).

It is our hope that the breadth of the Healthy Brain Network dataset will provide a practical perspective of the challenges of motion for various domains of illness and help to stimulate continued development and testing of novel correction strategies.